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Quantum state discrimination is a fundamental primitive in quantum statistics where one has to 
correctly identify the state of a system that is in one of two possible known states. A programmable 
discrimination machine performs this task when the pair of possible states is not a priori known, 
but instead the two possible states are provided through two respective program ports. We study 
optimal programmable discrimination machines for general qubit states when several copies of states 
are available in the data or program ports. Two scenarios are considered: one in which the purity 
of the possible states is a priori known, and the fully universal one where the machine operates over 
generic mixed states of unknown purity. We find analytical results for both, the unambiguous and 
minimum error, discrimination strategies. This allows us to calculate the asymptotic performance 
of programmable discrimination machines when a large number of copies is provided, and to recover 
the standard state discrimination and state comparison values as different limiting cases. 
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I. INTRODUCTION 

Discrimination between given hypotheses is one the 
most basic tasks in our every day lives. Very often we 
are confronted with the necessity of having to identify 
an option between some possible choices based on some 
acquired evidence. In the quantum setting the discrim- 
ination problem consists of identifying one of two pos- 
sible states given a number of identical copies available 
for measurement. This task encompasses a plethora of 
non-trivial theoretical and experimental implications. In 
the usual setting the a priori states are known, i.e., the 
classical information characterizing the possible states is 
provided and the discrimination protocol is tailored for 
this specific information. One usually considers two types 
of approaches: unambiguous [l| and minimum error (2| 
discrimination. An unambiguous protocol is one where 
the identification of the state is error free. Of course, 
this is only possible stochastically, i.e, unless the states 
are orthogonal, the protocol must give an inconclusive 
answer (the "I do not know" outcome) with a non van- 
ishing probability. In the minimum error approach, the 
protocol always yields a definite answer, which may be 
wrong some of the times. An optimal protocol is one 
which minimizes the inconclusive or the error probabil- 
ity. It may also be possible to go continuously from one 
case to the other by considering margins of error proba- 
bilities Q . In spite of being such a fundamental problem, 
only very recently a closed expression for the asymptotic 
error probability has been obtained (see J^-^ and refer- 
ences therein), the quantum Chernoff bound, from which 
metric distances and state densities [7] can be derived. 

Very much in the spirit of universal computers, it is in- 
teresting to consider discrimination devices that are not 
specialized in a specific discrimination instance but can 
discriminate between arbitrary pairs of states [M Q • In 
these, the set of possible states enter the device as "pro- 
grams", i.e., the classical description of the states is not 
provided beforehand, rather the information is incorpo- 



rated in a quantum way (this can also be viewed as an 
instance of relative information [lOl)- These devices have 
program ports that are loaded with the program states, 
and a data port that is loaded with the unknown input 
state one wishes to identify. The device will identify the 
state of the data port as being one of the states fed in 
the program ports, but this identification will in general 
be erroneous with a probability that decreases with the 
number of copies of the states entering the ports. One 
can also regard these devices as learning machines jll|. 
where the device is instructed through the program ports 
about different states, and based on this knowledge the 
machine associates the state in the data port with one of 
the states belonging to the training set. Increasing the 
number of copies of states at the program and data ports 
of course increases the chances of correct identification. 

It is particularly relevant to understand how the prob- 
ability of error scales with an increasing number of copies 
and what are the corresponding error rates. The value 
of this rate is one of the most relevant parameters as- 
sessing the performance of the device. We will consider 
the discrimination of two general qubit states, although 
most of our results can be generalized to higher dimen- 
sional systems (see [IJ, Il3 for a single copy continuous 
variable setting). For simplicity we will assume that the 
prior occurrence probability of each state is identical and 
compute the unambiguous and minimum error rates for 
optimal programmable devices. 

Wc first study the performance of such devices for pure 
states. We compute the error probabilities for any num- 
ber of pure qubit states at the input ports. Some of the 
results are already available in the literature 0, [l4| - [l8| , 
but the way we formalize the problem here is crucial to 
treat the more general mixed state case. In addition we 
obtain analytical expressions that enable us to present 
the results and study limiting cases in a unified way. 
In particular, when the program ports are loaded with 
an infinitely large number of copies of the states we re- 
cover the usual state discrimination problem Q], since 



it is clear that then one has the classical information 
determining the states entering the program ports. On 
the other hand, when the number of copies at the data 
port is infinitely large, while the number of copies at the 
program ports are kept finite, we recover the state com- 
parison problem [13, [201 ■ 

We extend the previous pure state study to the case of 
mixed input states. In this scenario we only compute the 
minimum error probability, as no unambiguous answers 
can be given if the states have the same support 2l| . The 
performance of the device for a given purity of the input 
states allows to quantify how the discrimination power is 
degraded in the presence of noise. The expressions here 
are much more involved, however one can still exploit 
the permutation symmetry of the input states to write 
the problem in a block-diagonal form [22|, [23| . We then 
obtain closed expressions for the probability of error that 
can be computed analytically for small number of copies 
and numerically evaluated for a fairly large number of 
copies. We are also able to obtain analytical expressions 
for some asymptotic rates. Again, the leading term, as in 
the pure state case, is seen to coincide with the average 
minimum error for known states. 

We also analyze the fully universal discrimination ma- 
chine, i.e., a device that works optimally for completely 
unknown input states. In this case one has to assume 
a uniform distribution for the purity. In contrast to the 
pure state distribution, there is no unique choice Q , and 
different reasonable assumptions lead to different uniform 
priors. Here we consider hard-sphere, Bures and Chernoff 
priors. 

The paper is organized as follows. In the next Section 
we obtain the error probabilities for pure states when 
each program port is fed with n copies of each state and 
there are m copies of the unknown state entering the data 
port. In Sec mil we study the asymptotic rates in several 
scenarios. In Section IIVI we analyze the performance of 
these devices when the ports are loaded with copies of 
states of known purity and obtain some interesting limit- 
ing cases in Sec. |Vl We finally obtain the error rates for 
the fully universal programmable machine. Some brief 
conclusions follow and we end up with two technical ap- 
pendices. 



II. PURE STATES 

Let us start by fixing the notation and conventions 
used throughout this paper. We label the two program 
ports by A and C. They will be loaded with states {ipi) 
and |'02), respectively. The data port, B, is the mid- 
dle one and will be loaded with the states we wish to 
identify. We also use the short hand notation [ip] to de- 
note |i/')('0| and similarly [ipip . . .] — ["0] <8) [0] (8) • • • = 
IV'XV'I ^ \4'){4>\ <8' • • • • We may also omit the subscripts 
A, B and C when no confusion arises. We assume that 
the program ports are fed with n copies of each state and 
the data port with m copies of the unknown state. This 



is a rather general case for which closed expressions of 
the error probabilities can be given. The case with arbi- 
trary nA,nB^ and nc copies at each port is discussed in 
appendix [X] The expressions are more involved but the 
techniques are a straightforward extension of the ones 
presented here. 

When the state at the data port is j-^i) ™ or |'02) , 
the effective states entering the machine are given by the 
averages 
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respectively. The integrals can be easily computed using 
Schur lemma, J d(t)[(t)\x = t~1x, where dx is the dimen- 
sion of the Hilbert space spanned by {\(t>)} and Ix is the 
projector onto this space. Hence 
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where Ti-XY is the projector onto the completely sym- 
metric subspace of Hx (8> "Hy and dxY — tilxY is its 
dimension. For qubits we have dA = dc = n + 1 and 
dAB = due = n + m+l. 

The structure of the states ([2]) suggests the use of the 
angular momentum basis, \JAiiB{JAB)iic';J^i) for cti 
and \iA,JB,jc{JBc)] JM) for 02- The quantum numbers 
JAB = JA +JB and JBc = JB +jc recall the way the three 
spins are coupled to give the total angular momentum J. 
Here the angular momenta have a fixed value determined 
by the number of copies at the ports, ja = jc = 'n'/2, 
JB = m/2. So, we can very much ease the notation by 
only writing explicitly the labels jab and jbc- We would 
like to stress however that in general one needs to keep 
track of all the quantum numbers, specially when dealing 
with mixed states as in Sec. IIVI 

In cTi the first n + m spins are coupled in a symmetric 
way, while in (72 the symmetrized spins are the last n+m, 
thus JAB — (n + m)/2 = Jbc- The states are diagonal 
in the angular momentum bases discussed above, and we 
have 
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where the lower limit of the first summation takes the 
value (1/2) for m even (odd). Notice that the spectnmi 
of both matrices is identical and that the basis elements 
of their support differ only in the way the three spins are 
coupled. Further, the key feature of the total angular 
momentum bases is the orthogonality relation 

{Jab; JM\jbc; J'M') = V J ^ J'ovM ^ M'. (4) 



Bases of this type are known as Jordan bases of sub- 
spaces [ij]. Since a state of the first basis (labeled by 
jab) has overlap with only one state of the second basis 
(labeled by Jbc)i the problem is reduced to a discrimina- 
tion instance between pairs of pure states. Then the total 
error probability is simply the sum of the contributions 
of each pair. 

In the unambiguous approach, the minimum probabil- 
ity of an inconclusive result for a pair of states \(f)i) , \(f)2) 

I 



with equal priors is simply P'^{\(j)i} , \(f>2)) = |((/'i|'/'2)| 0, 
hence 

P' - -r^ E I O'^s ; -^^'^ b'sc ; JM) I . (5) 



dAsdc 



JM 



These overlaps can be computed in terms of the Wigner 
6-j symbols [24| : 
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and they are independent of M [2j|, therefore in what 
follows we omit writing the quantum number M, and we 
perform the sum over Af in ([5]) trivially by adding the 
multiplicative factor 2J+1. Substituting the value of the 
6j symbols for ja == jc = n/2, jb = m/2, jab = Jbc = 
(n + m)/2, and setting J ~ m/2 + k we obtain 



{Jab;J\Jbc\J) = 



m 
-k 



(7) 



with k — 0, 1, . . . , n (observe that J takes values from 
J — n + m/2 of the totally symmetric space down to 
J = m/2). 

Plugging the overlaps Eq. ([7]) into Eq. ([S]), we obtain 
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where notice that the dimension of the subspace of total 
angular momentum J is to, -I- 2fc -I- 1 and in the second 
equality we have used the binomial sums 



E 

n 

E^ 

fc=0 



m -\- k 

TO 

?77, + fc 
TO, 



n + m + 1 
m + 1 

n + TO, + 1\ n{m + 1) 

TTl + 1 



(9) 



In the minimum error approach no inconclusive results 
are allowed, but the machine is permitted to give wrong 
answers with some probability that one tries to minimize. 
This minimum error probability can be computed along 
the same lines as in the previous case. Recall that the 
error probability P^^ for two pure states |(/)i) , |02) and 
equal a priori probabilities is 
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The total error probability is just the sum of the con- 
tribution of each pair of states with the same quantum 
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numbers JM, {\jab\ JM) , \jbc\ JM)}, 



P 



ME 



-M^-t 



m + 2k + l 



fe=0 



{n + l)(n + TO + 1) 



X Wl- 



(to, -|- ky.nV 
(to, + n)lkl ^ 



.(11) 



It is instructive to obtain the well-known results when 
the ports are loaded with just one copy of each state Q, 
i.e., n = m, = 1. 

The inconclusive probability in the unambiguous ap- 
proach reads 

3/2 

P^ = - E i2J+mJAB^i;J\JBc^i;J)\ 
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(12) 



i.e., five out of six times the machine gives an inconclu- 
sive result and only 1/6 of the times identifies the state 
without error. Notice that the overlaps for J ~ 3/2 are 
one. This must be so since J — 3/2 corresponds to the 
totally symmetric subspace, which is independent of the 
way the spins coupled. That is, this subspace is identi- 
cal for (Ti and (72. This is the main source of error as 
it contributes 4/6 — 4/6 x 1 out of the total 5/6 error 
probability. The remaining 1/6 = 2/6 x 1/2 is the con- 
tribution of the J = 1/2 subspace, where the 2/6 is the 
probability of having an outcome on this subspace and 
1/2 is the overlap between the states [cf. Eq. ([7])]. 

The minimum error probability in the one copy case 
reads 
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which using Eq. ([7]) or directly Eq. pT|) gives 
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That is, approximately 1/3 of the times the outcome of 
the machine will be incorrect. 

The error probability in both, minimum error and un- 
ambiguous approaches, will of course decrease when using 
more copies of the states at the ports of the discrimina- 
tion machine. Equations ([5]) and (|lip give the unambigu- 
ous and minimum error probability for arbitrary values 
of n and m. They enable us to study the behaviour of 
the machine for a large number of copies in the program 
and the data ports, which is what we next discuss. 



where we have defined x = k/n and used the 
Euler-McLaurin summation formula at leading order 

ELo /(^) - ^lo dxf{nx). 

This result could be easily anticipated from the min- 
imum error probability with classical knowledge of the 
pure states. Recall that the minimum error probability 
given m identical copies is 
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and we just have to compute the average for all pairs of 
the above expression. Using |(V'i|V'2)P = cos^ 0/2, where 
9 is the relative angle between the Bloch vectors of the 
two states, one has 



III. ASYMPTOTIC LIMITS FOR PURE STATES 

Let us start by considering the case of an asymp- 
totically large number of copies at the program ports 
(n — )■ oo) while keeping finite the number of copies m at 
the data port. For the unambiguous discrimination one 
obtains from Eq. © 
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We wish to show that in this limit the programmable 
machine has a performance that is equivalent to a pro- 
tocol consisting in first estimating the states and then 
doing a discrimination of known states. The average of 
the inconclusive probability of this protocol over all in- 
put states should coincide with Eq. ([15]). Recall that for 
known \ipi) and IV'2) states, when a number m of copies 
of the unknown state is given, this probability reads 



p'-(V'i,V'2) = i(V'iiV'2)r 



(16) 



One can do an explicit calculation of the average 
(P-(?/'i,V'2)) = 1/2 /J" sine cos™ 61/2, but it is amusing 
to obtain it in a very simple way from the Schur lemma 
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where d,„/2 is the dimension of the symmetric space of 
m/2 qubits (notice that stricto sensu this procedure is 
only valid for m even). Plugging this average into (J16p 
one immediately recovers Eq. (J15p . 

Now we turn our attention to the minimum error prob- 
ability. Taking n — >■ 00 and using the Stirling approxi- 
z^e~^yj2iTz in Eq. ([TT|). one obtains 
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(20) 

and performing the change of variables x = sin 9/2 this 
equation is cast exactly in the form of ([T5|) . 

What can not be anticipated is the next order 0(l/n), 
which gives the very relevant information on how fast the 
protocol reaches the asymptotic value (dHJ. A lengthy, 
but rather straightforward, calculation yields the remark- 
able result that this term has a coefficient which coincides 
with the value of the integral L dxx\/l — x^™. At this 
order we therefore can write 
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We now analyze the complementary case, that is, when 
the number of copies at the data port is infinitely large, 
TO — )> 00, while the number n of copies at the program 
ports is kept finite. In this limit we have perfect knowl- 
edge of the data state, |V'), but we do not know to which 
program port it should be associated. Observe that this 
situation is very much the same as state comparison jl9| . 

In this scenario the inconclusive probability in the un- 
ambiguous approach reads from ([5]) 
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Let us see that this agrees with the average performance 
of standard state comparison. If the data state is the 
same as the program state in the upper or lower port, 
the effective states to be discriminated are 
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respectively, where (i„ = n + 1 is the dimension of the 
symmetric space of n-qubits and 1„ is the projector onto 
this subspace. The minimal inconclusive probability for 
these two states can be obtained with a POVM whose 
elements are {Ei = [^®"] ® [V'®"]^,£^2 = [V'®"]'^ ® 



[^®"],£;7 = l(S)t- El- E2}, where [ij®'']-^ = 1„-[V'®"], 
that is, with a POVM that checks in each register wether 
the state is | ip) or not . Then the inconclusive probabihty 
reads 



P- (V') = - (trEfai + trE'vCTa) = 

2 n + 
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(24) 



independently of the state jf/'), where we have used 
tr[V'®"]^ = tr(l„ - [V"^"]) = n. 

The minimum error probability in this limit can be 
tackled in a similar fashion. The asymptotic expression 
of Eq. ([TT|) , though not as direct as in the unambiguous 
case, is rather straightforward to obtain. Notice that the 
dominant factor in the term containing factorials inside 
the square root is m~^^"~'^-'. So, we can effectively re- 
place the square root term by one, for all k < n. Taking 
into account that for k ~ n the square root vanishes, we 
have 
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The minimum error probability of a strategy that first 
estimates perfectly the input state and then tries to asso- 
ciate the correct label to it is given by Helstrom formula 
for cTi and (72 [2] 
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where ||yl|| = tr\Mtyl is the trace-norm of operator A. 
Substituting the expression of the states p3|) we obtain 



P 



ME 



2(n+l)' 



V'^"] ® w 



§ni_L 



1- 



2(n+l) 



[V'®"]-L®[V;®"]|| 



||[^0n]^[^»n]^|| 



ly 2(n-t-l)' 



(27) 



where in the first equality we have subtracted the com- 
mon term [-0®"] (g) [ip^"^] from both states, in the second 
we have used the orthogonality of the operators and in 
the last equality we use tr[-0®"]-'- = n . As expected, the 
result is again independent of \^-)). 

To end this section we compute the asymptotic error 
probabilities for the symmetric case, that is, when all 
the ports are loaded with the same m — n (and large) 
number of copies. 

In the unambiguous approach when n = tti — > 00 the 
first nonvanishing order of (|8]) reads 
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To compute the minimum error probability, it is con- 
venient to write Eq. (|lip for n = tti as 
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where 



and 
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We first observe that Ck is a monotonically increasing 
function and hence it takes its maximum value at fc = n. 
Second, we note that around this point 
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where H{x) — —a; In a; — (1 — a;) ln(l — x) is the Shannon 
entropy of a binary random variable and we have used 
that k K, n and iJ (1/2) = 1. Similarly, one has 
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and hence c^ ~ 2 (" ^\ With this, the probability of 
error in this limit reads 



(34) 



Finally, we perform the change of variables /c — > n — fc 
and use that in Eq. (15111) Pn~k — 3/(2ri) for fc ~ to 
obtain 




where we have defined the function 
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which converges very quickly to its exact value (the first 
four terms already give a value that differ in less than 
10^'^ from the exact value). 



IV. MIXED STATES 

We now move to the case when the program and data 
ports are loaded with mixed states. This situation arises 
when, e.g., there are imperfections in the preparation or 
noise in the transmission of the states. It is reasonable 
to suppose that these imperfections have the same effect 
on all states, i.e. to consider that the states have all the 
same purity r. The input states are then tensor products 
of 
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where n^ is a unitary vector and a = {a^, <Jy,<Jz) are the 
usual Pauh matrices. In what follows we assume that 
only the purity is known, i.e. one knows the character- 
istics of the noise affecting the states, but nothing else. 
This means that the averages will be performed over the 
isotropic Haar measure of the S^ sphere, like for pure 
states. At the end of this section we also analyze the 
performance of a fully universal discrimination machine, 
that is, when not even the purity is considered to be 
known. 

Notice that mixed states can only be unambiguously 
discriminated if they have different supports [21| , which 
is not the case when the ports are loaded with copies 
of the states ([57]) as they are full-rank matrices. There- 
fore, only the minimum error discrimination approach 
will be analyzed here. It is worth stressing that the com- 
putation of the optimal error probability in the multi- 
copy case is very non-trivial, even for known qubit mixed 
states. Only recently feasible methods for computing the 
minimum error probability for a rather large number of 
copies have been developed and the asymptotic expres- 
sion of this probability has been obtained [^, 161]. The 
main difficulty can be traced back to the computation 
of the trace-norm [see Eg. ipS)) ] of large matrices. The 
dimension of the matrices grows exponentially with the 
total number of copies entering the machine, and for a 
relative small number of them the problem becomes un- 
manageable. However, as it will be clear, it is possible 
to exploit the permutation symmetry of the input states 
to write them in block-diagonal form [23, l2j| crucially 
reducing the complexity of the problem. 

The two effective states we have to discriminate are 

ai = / dn^dn2pTA ® pTb <» pfc 

(72 = /" dnidn2pf'^ (^ p% ^ pfS, (38) 

where dui — df2i/(47r) is the invariant measure on the 
two-sphere. Any state having permutation invariance, as 
e.g. p®", can be written in a block diagonal form using 
the irreducible representations of the symmetric group 
5"^. Each block is specified by the total angular mo- 
mentum j and a label a that distinguishes the different 
equivalent representations for a given j 



That is a = 1, . . . , i^". For each block we have |25 
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the number of equivalent representations for each j 
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which, of course, is the same for all equivalent irreducible 
representations, i.e., independent on the label a. We 
sketch here the origin of the factors appearing in (|4T|) 
(full details can be found in [231] ) . The first factor comes 
from the contribution from the n/2 — j singlets present 
in a representation j made up of n spin- 1/2 states. The 
summation term is the trace of the projection of the re- 
maining states in the symmetric subspace with total an- 
gular momentum j, where we can use the rotational in- 
variance of the trace to write each state in diagonal form 




l+r 
2 




This term simply reads 
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Very much in the same way as it happened in previous 
sections, the only difference between the diagonal basis 
of CTi and (72 is the ordering of the angular momenta 
couplings. In ai we first couple subspaces A and B and 
obtain 



PAB 



dnipf2 (g> pf 
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(,AB 



'1. 



where 



^Ub = J2 \UbMab){UbMab\ 

Mas 



(44) 



(45) 



is the projector onto the subspace with quantum num- 
bers S,AB = {JA,a.A,JBjaB,JAB} and C"^"'^'" is defined 
in Eq. gl]). Notice that C"^"'^' depends only on the pu- 
rity of the state and on the total angular momentum 
JAB- Notice also that the tensor product of a mixed 
state has projections in all subspaces and the blocks are 
not uniquely determined by the value of jab, i-e., one 
has to keep track of the labels ja and js as well. Of 
course, subspaces with different quantum numbers S,ab 
are orthogonal, i.e., tr[l5lj/] = (5^^'trl^. When couphng 
the third system one plainly adds the quantum numbers 

Cc = {jc,ac}- 

In the notation we have developed so far, the diagonal 
bases of fxi and (T2 are written as Bi = {\(,ab£,c]JM)} 



and B2 = {\^a£.bc'iJM)}, respectively. Obviously, each 
set contains 2^"+™ orthonormal states and Eq. ([55)) reads 

^^=11 llCtC]:r\U^BC,JM]. (46) 

^a£.bc JM 

We just have to compute the minimum error from the 
Helstrom formula (f26| for these two states. It is conve- 
nient to define the trace-norm term 



so that 



T = \\ai - 0-2! 



pME ^}_ fi_}_rp 



(47) 
(48) 



To compute T we need to know the unitary matrix A that 
transforms B2 into Si or vice versa. The elements of this 
unitary are given by the overlaps between the elements 
of both basis {£,ab^c]JM\^'^Cbci J'^^')- We observe 
that these overlaps are non- vanishing only if jx = j'x , 
ax = a'x (X = A, B, C) and J = J', M = M'. Further- 
more, as mentioned above, their value does not depend on 
M or ax, thus, sums over these quantum numbers sim- 
ply amount to introduce the corresponding multiplicative 
factors. Therefore, it is useful to introduce a label con- 
taining the quantum numbers that determine the orthog- 
onal blocks in Bi and B2 that may have non vanishing 
overlaps, ^ = {JA,JB,jCi J} Sind the corresponding mul- 
tiplicative factor 

7C = <-,"-;^(2J+l), (49) 

where ly" is given in Eq. (|40l) . Eq. (|Tf| then reads 

where the explicit expressions of the matrix elements are 






\"l )JabJ'ab~ 3ab3'ab^Jab ^jc 



'jBCfBC ~ "IBCJbC ]A^]BC 



\^2 JjBCJ'nn ~ °J 



and 



41l,,Bc = {£.^3ab\^,3bc)- 



(51) 



(52) 



Recall that the overlap (|52p is independent of the quan- 
tum number labelling the equivalent representations (re- 
call also that it is independent of M) and therefore is 
given by Eq. ([5]). 

The computation of the minimum error probability re- 
duces to a sum of trace-norms of small size Helstrom 
matrices that have dimensions of the allowed values of 
JAB and JBC for given £, = {JaJbJc, J}- Hence 
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ME 



IE-,tJ 



(53) 



and this computation can be done very efficiently. 

We would like to show the analytical results for the 
simplest case of having just one state at each port, i.e. 
when n = TO = 1. In this situation we have fixed values 
JA = JB = jc = 1/2, the total angular momentum can 
be J = 3/2, 1/2 and jab = 1,0 (and similarly for jbc)- 
Here there is no degeneracy, the number of equivalent 
representations defined in Eq. (j40p is one, and therefore 
the multiplicative factor (H^ simply reads 75 = 2J + 1. 
The only relevant quantum number in this case is ^ = J, 
as all the other are fixed, and we do not need to write 
them explicitly. The minimum error probability is then 



pME _ 



i(l-i f:(2J-Hl)|ki'^^-A(^)4^^A(^)"||) 



J=l/2 



(54) 

The term of the sum corresponding to J = 3/2 van- 
ishes since it corresponds to the projection of ai^2 onto 
the completely symmetric subspace, which is identical for 



(3/2) 



J3/2) 



both states. Indeed, in this subspace tri' 
C^Cl/2 = (3 + ''^)/24, where we have used Eq. (gS]), and 
from Eq. ([52]) we obtain A^'^'^' = 1. In the subspace 
J — 1/2 we have 



Jl/2) 
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^1^1/2 
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and 
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(55) 



(56) 



Plugging these expressions into (j54l) we obtain the mini- 
mum error probability of one copy state 



pME = 1(1 



271/ 



(57) 



As expected, when r — >■ 1 we recover the pure state value 

Numerical results of the minimum error probability as 
a function of the purity of the input states for the sym- 
metric case n — m are depicted in Fig. [T] One sees that 
for low values of n (n < 3) the dependence on the purity 
is not very marked, the curves are concave almost in the 
whole range of the purity. For larger n however there is 
an interval of purities where the behavior changes quite 
significantly. For, e.g., n — 29 the inflection point oc- 
curs at r w 0.3. At very large values of n one expects 
a step-like shape with an inflection point approaching 
r = because the probability of error remains very small 
for r ^ and is strictly 1/2 at r = 0. The shape of 
the curves is explained by the existence of two distinct 
regimes. For high purities the probability of error is well 
fltted by a linear function in the inverse of the number of 
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FIG. 1. (color online) Error probability p^^ for n = m — 3 
(blue dashed line), 11 (green circles) and 29 (yellow squares) 
versus purity. The fit P^^ ~ 0.882/(nr-^) in the regime of 
high purities for n = 11 and n — 29 and the Gaussian ap- 
proximation p^^ ~ l/2exp[— nr^/(2\/3)] in the regime of 
low purities for all cases is represented (solid lines). 



FIG. 2. (color online) Error probability P^"^ for r = 0.2 
(blue thin solid line), r — 0.5 (brown dashed line), r = 0.7 
(green dotted line) and r = 1 (red dot-dashed line) versus 
n (n = m is assumed). Numerical points have been joined 
for an easier visualization. The approximation 0.882/(nr^) is 
also represented (thin solid lines). 



copies. We get P^^ ~ OM/{nr^) where the value 0.88 
coincides with the analytical value computed for pure 
states Eq. (|35p. Of course, this approximation cannot be 
valid for low purities. In this range of low purity the min- 
imum error probability is very well approximated by the 
Gaussian function p^^ ~ l/2exp[— nr^/(2-\/3)], where 
we have taken the argument of the exponential from the 
exponentiation of the exact 1x1x1 case ([57| . This ap- 
proximation works for purities in the interval of the width 
of the Gaussian, i.e., up to '-^ 1/y/n. Therefore, as n in- 
creases the asymptotic approximation p^^ oc l/(nr^) 
extends its validity to almost the whole range of purities, 
and the expected jump discontinuity develops in r = as 
n — ^ oo. Similar information is depicted in Fig. [21 where 
the error probability is plotted as function of the number 
of copies n for different purities. We have superimposed 
the asymptotic result, which is seen to yield a very good 
approximation to the exact error probability already for 
n > 20. 



V. ASYMPTOTIC n x 1 x n 

As in previous sections, it is interesting to study the 
performance of the machine in the asymptotic regimes. 
A particularly important instance where it is possible to 
obtain closed expressions is the case when the number 
of copies at the program ports is asymptotically large 
and there is one state at the data port. We show how to 
compute the leading order and sketch the generalizations 
needed to obtain the subleading term. 

Observe first that jab can only take the values jab — 
JA ± 1/2 and similarly for jbc- Therefore all ^^^ 2x2 
matrices (except in the extremal case of J = JA+jc + ^/'^ 
which is one-dimensional). It is useful to write 



with 



R±{j) 



S±l/2 



With this definition one simply has [see Eg. dSTI) ]: 
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We further notice that for large n 
^ ^ ^ n/2 + j + l 2r 



o-(jc)- 



(59) 



(60) 



n7r(l 



■ exp 



{2j/n - rY 
'' 2(1 -r2) 



.(61) 



Defining y = 2j /n and using the Euler-Maclaurin sum- 
mation formula, we have for a generic function f{j) 



E<c7/(j) 



l + r 
2r 



dyGnjy) ,ny 

n/2 + ny/2 + V^ 2 ' 



(62) 

where we have extended limits of integration from (0,1) 

to (—00,00), which is legitimate for large n, and defined 



Gn{y) = 



27r(l-r2) 



exp 



(2/ " rf 



2(1 -r2) 



(63) 



a Gaussian distribution centered at y = r with variance 
cr^ = (1 — r^)/n. Notice that at leading order, n — > 00, 
Goo ~ 5{y — r) and hence 

3 

Notice also that at this order 



nr^^ 2' 



(64) 



^i^) = C-.G-J"^^^'^ ^0^.)^ 



(58) 



„ / ^ „ ,nr ^ 1 ± r 



(65) 
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FIG. 3. (color online) Error probability P^^ for n = 20 
(yellow circles) and n = 79 (green squares) versus purity. 
The asymptotic behaviour given by Eq. (|70p is represented 
for both cases. 



There only remains to compute the unitary matrix 
Eq. (|52|) . Observe that the total angular momen- 
tum takes values J = \ja — Jc| + 1/2 + fc with k — 
0, 1, . . . , 2 iTnii{j A, jc}- The leading order is rather easy 
to write (the subleading term, although straightforward, 
is far more involved, and we will not show it here). At 
this order we have J = 1/2 + k and fc = 0, 1, . . . , nr and 
the matrix elements computed from ^ yield 



1 



k 



nr \\/{nr)^ — fc^ 






Plugging Eqs. (|58ll66l) into Eq. ([50)) one gets 

2 



^2fc 

fc=0 



:^/(nrf 



fc2 



(66) 



(67) 



where the sum over ja and jc has been trivially per- 
formed by substituting their central value nr/2 in the 
summand and the only remaining multiplicative of 7^ [cf. 
Eq. (gni] is 2 J -f 1 ~ 2k. Finally, defining x = k/nr and 
using the Euler Mac-Laurin approximation as in Eq. (jlSp 
we obtain 



T: 



4r / dxxy/l 
'0 



4r 



and hence 



P 



ME 



(68) 



(69) 



which obviously coincides with the pure state result 
Eq. (dH) for m = 1 and r -> 1. 

As for the computation of the next-to-leading order, 
the integrals approximating the sums over ja and jc have 
to incorporate the fluctuations around the central value, 
i.e., one defines ja = f (?" + Va) and jc = f (r + rjc), 
where the variables rjx have effective dimension n^^'^. 
Then one can expand the matrix elements of <ti,2, A, 
and the terms of i/" present in Eq. ([5^ taking into ac- 
count the effective dimensionality of all the terms [notice 
that k — >■ n{r + ri)x, where the integration range of x is 



(0, 1)] . One then performs the sum in k by means of 
the Euler-Maclaurin summation formula as before. Fi- 
nally one computes the integration in Ja/b taking into 
account that range of the variables tja/b can be taken to 
be (— cx),cx)). After a somewhat lengthy calculation we 
obtain 



iME 



1 

3nr 



(70) 



Notice that the limit r = is singular and not surpris- 
ingly the expansion breaks down for purities of order 1/n. 
As it should, the error probability ([70)1 increases mono- 
tonically with the purity. 

In Fig. [3] we plot the error probability as a function 
of the purity for n = 20 and n = 79. One sees that 
the asymptotic expression (j70p approximates very well 
the minimum error probability even for small number of 
copies . For larger n, e.g., for n = 79 the approximation 
works extremely well down to values below r = 0.3. 

We finish this section by showing that the leading term 
(p5)) coincides with the average error of a device that first 
estimates the mixed states at the program ports and af- 
terwards does the usual minimum error discrimination of 
the data state. From the Helstrom formula I^E^ particu- 
larized for mixed qubit states one has 
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ME 
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In - r2 



(71) 



where the average is taken over all possible orientations 
of the Bloch vectors r^ and r2 • For equal purity states it 
simply reads 
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ME 



1 / r 
2^-2 



d6l sin 6* sin 6'/2 



1 ?■ , ^ 
2-3- (^2) 



FULLY UNIVERSAL DISCRIMINATION 
MACHINE 



Let us finally address the fully universal discrimina- 
tion machine, i.e., a machine that distinguishes states 
from which nothing is assumed to be known, not even 
its purity. For this type of machine, we need to specify 
a prior distribution for the purity. While the isotropy 
of the angular variables yields a unique uniform distri- 
bution for the angular variables, the Haar measure on 
the 2-sphere used in previous sections, the corresponding 
expression for a fully unbiased distribution of the purity 
w{r) is not uniquely determined. This is a longstanding 
issue, and several priors haven been suggested depending 
on the assumptions made [3, [2a]. Here we will not stick to 
a particular distribution, rather we will show results for 
three reasonable distributions. The actual values of the 
the probability of error may depend on the chosen prior, 
but the overall performance is seen to be very similar. 

The most straightforward, but perhaps not very well 
grounded, choice is that of the distribution of a hard- 
sphere w{r) ex r^, i.e., a normalized integration measure 
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FIG. 4. (color online) Error probability p^^ for hard-sphere 
(green solid line), Bures (blue dotted line) and Chernoff (red 
dashed line) priors versus n (n = m is assumed). The points 
correspond to the error probability for a fixed r = 0.9; its 
proximity to the Chernoff curve exposes the fact that this 
prior gives larger weights to states of high purity. 
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we can use rotational invariance in the same fashion as 
in Eqs. (pi|) and (|46l) . Therefore, here it is only required 
to compute the average of the coefficients C" in Eq. (|1T|) 
according to priors (J73II75I) . To calculate the minimum 
error probability of this fully universal machine one sim- 
ply uses Eq. (|53l) for the states (|46l) with the averaged 
coefficients (C") computed in Appendix [BJ 

In Fig.|4]we present the minimum error probability of 
the fully universal machine for the three priors discussed 
for an equal number of program and data states up to 
n — m — 26. As anticipated, the smaller average error 
corresponds to the Chernoff distance, because states with 
higher purity are assigned a larger weight, and these are 
easier to discriminate. The probability of error, as some- 
how expected, is inversely proportional to the number of 
copies, and attains very similar values than for the dis- 
crimination of states with fixed known purity of the order 
of r- 0.9. 



given by 



dpHS ^ 3^2^^ 



47r 



(73) 



The Bures distribution is far better motivated. It cor- 
responds to the volume element induced by the fidelity 
distance ^3 . It is monotonically decreasing under coarse 
graining [2^ and it has been argued that it corresponds 
to maximal randomness of the signal states [23 • In this 
case one has w(r) ex r^/Vl — i""^- Notice that this dis- 
tribution assigns larger weights to pure states, as their 
distinguishability in terms of the fidelity is larger than 
that of mixed states. The integration measure reads 



dp 



Bu 



TT VT 



zdr 



47r 



(74) 



Lastly, we also consider the recently proposed Chernoff 
distribution Q. It is the prior induced by the Chernoff 
distance, which has a clear operational meaning in terms 
of the distinguishability between states. By construc- 
tion it is monotonically decreasing under coarse graining. 
This measure assigns even larger weights to states of high 
purity and lower to the very mixed ones. This assignment 
is again based on the distinguishability properties, but in 
terms of the asymptotic behavior of the error probability. 
The measure can be written as \m 



dp 



Ch 



1 (\/TT7 - \/T"^) , dVL 
^^ ^— dr — 

47r 



7r-2 ^/Y~r^ 

The effective states we have to discriminate are 



(75) 



dp,dp2pTA®PtB®PtZik^l,2), (76) 



where dpk takes the expressions of the measures (|73ll75p . 
Note that the block structure of the states is the same as 
before, as it only depends on the permutation invariance 
of the input states, which remains untouched. Further, 



VII. CONCLUSIONS 

We have studied the problem of programmable dis- 
crimination of two unknown general qubit states when 
multiple copies of the states are provided. For pure states 
we have obtained the optimal unambiguous discrimina- 
tion and minimum-error probabilities, Eqs. (|S]) and (jlip . 
respectively. Some results along these lines can be found 
in [17], however no closed expressions were given there. 
Knowing the error in the asymptotic regimes is a very 
relevant information as it allows to assess and compare 
the performance of devices in a way which is independent 
on the number of copies. We have obtained analytical ex- 
pressions for the leading and sub-leading terms in several 
cases of interest. As could be anticipated, when the num- 
ber of copies at the program ports is asymptotically large 
at leading order we recover the average of the usual dis- 
crimination problem of known states Eqs. ([T7| and (|20p . 
When the data port is loaded with an asymptotically 
large number of copies, we recover the state comparison 
averaged error Eqs. (l24l) and ((27)) . These cases corre- 
spond to measure and discriminate protocols, where the 
measurement unveils the classical information about the 
states. 

We have addressed for the first time the programmable 
discrimination of copies of mixed states. We have ob- 
tained the minimum-error probability when the ports are 
loaded with copies of qubits of known purity, Eq. ([551) . 
We have assumed that all states have the same purity. 
This would, e.g., correspond to a scenario where all the 
initially pure data and program states are subject to 
the same depolarizing noise before entering the machine. 
Closed analytical results for a small number of copies can 
be obtained and efficiently computable expressions for a 
fairly large number of copies are given. The asymptotic 
analytical results show very good agreement with the 
numerics. The latter show a characteristic 1/A'^ depen- 
dence with the number N of available copies — in contrast 
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to the usual exponential decay found in standard (non- 
universal) state discrimination — and provide a very good 
approximation already for relatively low number of copies 
when the states have high purity. For very mixed states 
the error probability has a drastically different behavior. 
Logically, in both cases the error probability monotoni- 
cally decreases with increasing purity r, but in the low 
purity regime the dependence is much less pronounced. 
The range of purities exhibiting this behavior shrinks as 
the number of copies increases, and the characteristic 
1/A'' behavior of the asymptotic regime extends its valid- 
ity over almost the whole range of purities. 

Finally we have studied the fully universal discrimi- 
nation machine, a device that takes in states of which 
nothing is known, i.e., not even its purity. We compute 
the minimum error probability for three reasonable prior 
distributions of the purity: hard-sphere, Bures and Cher- 
noff (see Fig.|4]). The latter is seen to give the lowest error 
probability. This comes as no surprise, since the Chernoff 
distribution assigns larger weights to pure states (because 
they are better distinguished). Our results also indicate 
that the fully universal discrimination states yields an er- 
ror probability comparable to the discrimination of states 
of known purity, being that remarkably large (r ~ 0.9). 
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Appendix A: Arbitrary copies of pure states 

In this appendix we present the unambiguous discrim- 
ination and minimum error probabilities when the num- 
ber of copies nA,nB,nc loaded at the machine ports is 
completely arbitrary. Note that in this case the global 
states (Ti and (72 [cf- Eq. ^] may have different dimen- 
sions, for di = {uA + ns + l)("-c + 1) is in general not 
equal to d2 — (jia + 1)(?^_B + "-c + !)• One can eas- 
ily convince oneself that the support of the state with 
smallest dimension is always contained in the support of 
the other, and hence the problem can be solved in very 
much the same way as in the main text simply taking 
into account that the error probabilities now only con- 
tain contributions from the intersection of the supports. 
Without loss of generality we can assume from now on 



that UA > nc- As discussed in the main text, the error 
probabilities are computed by adding the pairwise con- 
tributions of the state bases in the common support, the 
main difference being that cji and tT2 do not have equal 
coefficients in front of the projectors and hence the prior 
probabilities of each pair of states are different. Also, 
the overlaps in Eq. ([B]) will have a slightly more compli- 
cated expression. Here we have ja = ^1^/2, Jb — ^iB/2, 
Jc = ncl^, JAB = (riA + "-s)/2 and jbc = {nB + nc)/2. 
The minimum J available for ai is jb + JA — jc = Jmin i 
and \JB + jc^ Ja\ = Jmin fo'" "'a- The maximum angular 
momentum ja + Jb + jc = Jmax is reachable for both 
states. For equal prior probabilities for ai and (T2, we 
can write 



2^^1=1^ Yl pj^][Jab;JM], (Al) 

J=J^^^ M=-J 

. ./max ./ 

9'^2= E E P./^jfcc;JM], (A2) 



J=J^ . M=-J 



where pj = I (^j- + j-^ tt) 



„ , , tTt = „ , for 

2pj d\ ' J 2pj 0,2 

Jlnin < J < Jmax, whcrcas P,j = ^, tt] = 0, tt} = 1 for 
Jmin 1^ J < Jmin ■ We cau view pj as the probability of 
projecting on subspace labeled by a given J and ill, and 
TTj, TTj as the prior probabilities that, on that subspace, 
the unknown state be [jab\JM] or [jbc]JM]. If the 
condition 



1 



^ ^AB ^ 



1 



(A3) 



where cj = \{Jab; JM\jbc;JM)\ is given by Eq. ©, 
holds, then the probability of obtaining an inconclusive 
answer when we finally discriminate between [jab ', JM] 
and [JBc;JM] is m P] = 2y/TT]njcj. If Eq. (jA3l is 
satisfied for J — Jmax — 1, then it will be satisfied all over 
this range of J, since cj is a monotonically increasing 
function of J. The overlap Cj has the very simple form 



UAUc 



{uA + nB){nB + nc)' 



(A4) 



Thus, Eq. (|A3[) is equivalent to 



nAUc 



< 



{uA + riB + l)(nc + 1) 

{ha + nB)(nB + nc) ~ [nB + nc + l)(nA + 1) 

{nA + nB){nB + nc) 



< 



nAnc 



,(A5) 



which is clearly true. Eq. (jA3l) does not hold if J = Jmax, 
for which we have Pj =1. Notice that since no error 
is made for Jmin ^ J < Jmin, fo^' ""j = ^, tl^^ total 
inconclusive probability reads P' ~ J2'j'Zy^ PJ C^J + 
l)Pj, which has the explicit expression 
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"A + riB — nc + k 
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y UB ) 
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where (iyiiB(7 = ny^+ns+nc+l. Note that when n^i = nc The minimuin error probabiHty can be computed en- 

the term proportional to cIabc disappears and the square tirely along the same lines. For a pair of states we 
root term simplifies, so we recover the closed form given ^ave [H Pf^ = 1 (^ ~ \/l - 4,n]7T^jc]) , and the total 

in the main text [cf. Eq. JSIl. , , .,-. \ ^ 

error probabinty reads 
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This expression coincides with Eq. (31) of |29| . 



Appendix B: Averaged C? coefficients 



Here we compute the average of the coefficients [see 
Eq. gl])] 
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for the hard sphere, Bures and ChernofF priors, 



For the hard sphere prior we have 
(q')HS=3 [c;^r^dr 



^^ r{n/2 + j + 2)r{n/2-j + l) ^ ^g^) 



T{n + 4) 
The Bures distribution yields 
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The averages for the Chernoff prior are a bit more in- 
volved, but still can be given in a closed form as 
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Ch 



TT-2,1 ^ 

2 

(^-2)(2j + 1) 



Cl^i^m^^EILdr 



3 

Efn fn+l-2m ri+l+2?ri^ 
L-^1/2 I 2 ' 2 J 



Eqs. (J73ll75p . considered in the fully universal discrim- where Bx{a,b) = /g t"'^^{l — t)''~^dt is the incomplete 



ination machine. 



beta function l30l. 
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